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VIDEO DATA COMPRESSION APPARATUS AND METHOD OF SAME 



BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

Tlie present invention relates to a video data 
compression apparatus for compressing and coding 
noncompressed video data and a method of same. 
2. Description of the Related Art 
10 When compressing and coding noncompressed 

digital video data in units of GOPs (groups of pictures) 
comprised of an I picture (intra-coded picture), a B 
picture (bi-directionally predictive coded picture), and d 
P picture (predictive coded picture) by a method such as 
15 the MPEG (Moving Picture Experts Group) and recording the 
same on a recording medium such as a magneto-optical disc 
(MO disc) or transmitting the same via communication line, 
it is necessary to control the amount of data (amount of 
bits) of the compressed video data after the compression 
20 and coding to be not more than the recording capacity of 
the recording medium or not more than the transmission 
capacity of the communication line while holding the 
quality of the image after expansion and decoding high. 

For this purpose, first, there is adopted a 
25 method of preliminarily compressing and coding the 
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noncompressed video data to estimate the amount of data 
after the compression and coding (first pass), tiien 
adjusting the compression rate based on the estimated 
amount of data and performing the compression and coding 
5 so that the amount of data after the compression and 

coding becomes not more than the recording capacity of the 
recording medium by (second pass) (hereinafter, such a 
compression and coding method will be also referred to as 
"two pass encoding"). 

10 However, if performing the compression and 

coding by the two pass encoding, it is necessary to apply 
similar compression and coding processing two times with 
respect to the same noncompressed video data, so a long 
time is taken. Further, since the final compressed video 

15 data cannot be calculated by one compression and coding 
processing, the shot video data cannot be compressed, 
coded, recorded, or transmitted in real time as it is, 
SUMMARY OF THE INVENTION 

An object of the present invention is to provide a 
20 video data compression apparatus with which the audio 
and/ or video data can be compressed and coded to a 
predetermined amount of data or less not by the two pass 
encoding and a method of the same. 

Further, another object of the present invention is to 
25 provide a video data compression apparatus with which the 
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video data can be compressed and coded in almost real 
time and in addition with which a high quality image can 
be obtained after the expansion and decoding and a method 
of the same. 

Further, still another object of the present 
invention is to provide a video data compression 
apparatus with which the compression and coding 
processing can be carried out by adjusting the 
compression rate by estimating the amount of data after the 
compression and coding not by the two pass encoding and a 
method of the same. 

According to a first aspect of the present 
invention, there is provided a video data compression 
apparatus having an indicator data calculating means for 
calculating indicator data indicating a complexity of 
video data for every picture from noncompressed video 
data; a target value calculating means for calculating a 
target value of an amount of data after compression of 
the video data for every picture based on the calculated 
indicator data; and a compressing means for compressing the 
noncompressed video data so that the amount of data after 
compression becomes the calculated target value. 

According to a second aspect of the present 
invention, there is provided a data compression method 
comprising the steps of calculating indicator data 



indicating a complexity of video data for every picture 
from noncompressed video data; calculating a target value 
of an amount of data after compression of the video data 
for every picture based on the calculated indicator data; 
and compressing the video data by a predetermined 
compression method so that the amount of data after 
compression becomes the calculated target value. 

According to a third aspect of the present 
invention, there is provide a video data compression 
apparatus having an indicator data calculating means for 
calculating indicator data indicating a complexity of 
video data for every picture; a difficulty data 
calculating means for performing a predetermined 
computation processing for multiplying a coefficient with 
the calculated indicator data to calculate difficulty 
data corresponding to the amount of data after 
compression of the video data; a target value calculating 
means for calculating a target value of the amount of 
data after compression of the video data for every 
picture based on the calculated difficulty data; a 
compressing means for compressing each of the pictures of 
the video data by a predetermined compression method so 
that the amount of data after compression becomes the 
calculated target value so as to generate compressed 
video data; and a coefficient updating means for updating 



the coefficient based on the amount of data of the 
generated compressed video data. 

According to a fourth aspect of the present 
invention, there is provided a video data compression 
method comprising the steps of calculating indicator data 
indicating a complexity of video data for every picture; 
performing predetermined computation processing for 
multiplying a coefficient with the calculated indicator 
data to calculate difficulty data corresponding to the 
amount of data after compression; calculating a target 
value of the amount of data after compression of the 
noncompressed video data for every picture based on the 
calculated difficulty data; compressing each of the 
pictures of the video data by the compression method so 
that the amount of data after compression becomes the 
calculated target value so as to generate compressed 
video data; and updating the coefficient based on the 
amount of data of the generated compressed video data. 

According to a fifth aspect of the present 
invention, there is provided a video data compression 
apparatus for compressing a continuous plurality of video 
data to compressed video data of a picture type sequence 
containing a plurality of types of pictures (I picture, P 
picture, and B picture) in a predetermined order, having a 
rearranging means for rearranging pictures of the 




noncompressed video data to an order adapted to the 
compression method so that each head picture of the video 
data becomes an I picture or a P picture; an indicator 
data calculating means for calculating indicator data 
5 indicating a complexity of the rearranged noncompressed 
video data for every picture; a border detecting means 
Jh for detecting a scene change of a continuous plurality of 

J= the noncompressed video data; a changing means for 

L changing the picture type sequence so that pictures of 

Oi 10 any of the noncompressed video data are compressed 

p without reference to the pictures of the other 

M= noncompressed video data for every border of a detected 

O scene change; a target value calculating means for 

calculating a target value of the amount of data after 
15 compression of the video data based on the calculated 
indicator data and the picture type sequence after 
change; and a compressing means for compressing the video 
data to compressed video data of the picture type 
sequence after change so that the amount of data after 
20 compression becomes substantially the calculated target 
value . 

According to a sixth aspect of the present 
invention, there is provided a video data compression 
method for compressing a continuous plurality of video 
25 data to compressed video data of a picture type sequence 
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containing a plurality of types of pictures (I picture , P 
picture, and B picture) in a predetermined order, 
comprising the steps of rearranging pictures of the 
noncompressed video data to an order adapted to the 
compression method so that each head picture of the video 
data becomes an I picture or a P picture; calculating 
indicator data indicating a complexity of the rearranged 
noncompressed video data for every picture; detecting a 
scene change of the continuous plurality of the 
noncompressed video data; changing the picture type 
sequence so that a picture of any of the noncompressed 
video data is compressed without reference to a pictures 
of other noncompressed video data for every border of a 
detected scene change; calculating a target value of the 
amount of data after compression of the video data for 
every predetermined compression unit based on the 
calculated indicator data and the picture type sequence 
after change; and compressing the video data to the 
compressed video data of the picture type sequence after 
change so that the amount of data after compression 
becomes substantially the calculated target value. 

Note that flatness is defined as an indicator 
representing a spatial flatness of van image, represents 
the complexity of the image, and has a correlation with the 
difficulty (degree of difficulty) of the pattern of the 




image and the amount of data after compression. 

Further, an intra AC is defined as the sum of the 
dispersion values of the average value of the video data 
in the pictures and the video data for every DCT block of 
5 the DCT processing unit in the MPEG system, represents 
the complexity of the image, and has a correlation with 
y the difficulty (degree of difficulty) of the pattern of 

^ the image and the amount of data after compression 

similar to the flatness. 
" 10 Further, an ME residual is defined as the sum of 

absolute values or the sum of square values of the 
difference of the video data between the input picture 
and a reference picture after motion compensation 
processing by the motion vector at the compression and 
15 coding, represents the speed of the motion of the image 
and the complexity of the pattern, and has a correlation 
with the difficulty (degree of difficulty) of the pattern 
of the image and the amount of data after compression 
similar to the flatness. 
20 Further, the activity is defined as the activity 

used for determining a quantization value of quantization 
processing with respect to a raacroblock in TM5 [test 
model 5; ISO/IEC JTC/SC29 (1993)] famous as a compression 
algorithm of the MPEG system. 
25 BRIEF DESCRIPTION OF THE DRAWINGS 



These and other objects and features of the present 
invention will become clearer from the following 
description of the preferred embodiments given with 
reference to the attached drawings, in which: 

Fig. 1 is a view of the configuration of a video 
data compression apparatus according to the present 
invention; 

Fig- 2 is a view of the configuration of an encoder 
of a simplified two pass processing unit shown in Fig. 1; 

Fig. 3 is a view of the configuration of the encoder 
shown in Fig. 1; 

Figs. 4A to 4C are views of an operation of the 
simplified two pass encoding of the video data 
compression apparatus according to a first embodiment of 
the present invention 

Fig. 5 is a schematic view of the configuration of 
the video data compression apparatus according to a 
second embodiment of the present invention; 

Fig. 6 is a detailed view of the configuration of a 
compression and coding unit of the video data compression 
apparatus shown in Fig. 5; 

Fig. 7 is a view of the correlation between an ME 
-residual and a real difficulty data when producing a P 
picture by the video data compression apparatus shown in 
Fig. 1 and Fig. 5; 
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Fig* 8 is a view of the correlation between the ME 
residual and the real difficulty data when producing a 
B picture by the video data compression apparatus shown 
in Fig* 1 and Fig. 5; 
5 Fig* 9 is a view of a method of calculation of 

flatness; 

Fig* 10 is a view of the correlation between the 
flatness and the real difficulty data when producing 
an I picture by the video data compression apparatus shown 

10 in Fig. 1 and Fig. 5; 

Fig. 11 is a view of the correlation between the 
flatness and the real difficulty data when producing 
an I picture by the video data compression apparatus shown 
in Fig. 1 and Fig. 5; 

15 Fig* 12 is a view of a compression and coding 

operation of the video data compression apparatus (Fig* 
6) according to a third embodiment of the present 
invention ; 

Fig* 13 is a view of the content of the processing 
20 of the host computer (Fig. 6) of the video data 

compression apparatus 2 according to the third embodiment 
of the present invention; 

Fig. 14 is a view of the compression and coding 
operation of the video data compression apparatus 
25 according to a fourth embodiment of the present invention 
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in an order of coding; 

Figs. ISA to 15C are views of a processing for 
rearrangement of the order of the pictures of the edited 
video data by an encoder control unit (Fig, 1) and 
processing for changing the picture type by the host 
computer; and 

Fig. 16 is a flowchart of the content of the 
processing of the host computer of the video data 
compression apparatus according to a fourth embodiment of 
the present invention 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

First Embodiment 

Below, a first embodiment of the present invention 
will be explained. 

If a plurality of video data having a high coding 
difficulty such as a pattern having a large number of 
high frequency components or a pattern having a large 
amount of motion are compressed and coded by a 
compression and coding system of video data such as the 
MPEG system, generally distortion accompanying the 
compression is apt to occur. For this reason, it is 
necessary to compress and code video data having a high 
difficulty with a low compression rate and allocate a 
larger target amount of data with respect to the 
compressed video data obtained by compressing and coding 
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the data having a high difficulty in comparison with 
compressed video data of video data of a pattern having a 
low difficulty. 

In order to adaptively allocate the target amount 
of data with respect to the difficulty of the video data 
in this way, the two pass encoding system indicated as the 
related art is effective. However, the two pass encoding 
system is not suited to real time compression and coding. 

The simplified two pass encoding system indicated as 
the first embodiment was devised so as to solve the 
disadvantage of such a two pass encoding system. It can 
calculate the difficulty of the noncorapressed video data 
from the difficulty data of the compressed video data 
obtained by preliminarily compressing and coding the 
noncompressed video data and adaptively control the 
compression rate of the noncompressed video data delayed 
by exactly a predetermined time by an FIFO memory or the 
like based on the difficulty calculated by the 
preliminary compression and coding. 

Figure 1 is a view of the configuration of a video 
data compression apparatus 1 according to the present 
invention. 

As shown in Fig. 1, the video data compression 
apparatus 1 is constituted by a compression and coding 
unit 10 and a host computer 20. The compression and coding 




unit 10 is constituted by an encoder control unit 12, a 
motion detector 14, a simplified two pass processing unit 
16, and a second encoder 18, The simplified two pass 
processing unit 16 is constituted by an FIFO memory 160 
5 and a first encoder 162* 

The video data compression apparatus 1 realizes the 
O above simplified two pass encoding with respect to the 

J:: noncompressed video data VIN input from an external 

yi apparatus (not illustrated) such as an editing apparatus 

pi 10 and a video tape recorder apparatus by these constituent 

D parts. 

H In the video data compression apparatus 1, the host 

O computer 20 controls the operation of the constituent 

parts of the video data compression apparatus 1, Further, 

15 the host computer 20 receives the amount of data of the 
compressed video data generated by preliminarily 
compressing and coding the noncompressed video data VIN 
by the encoder 162 of the simplified two pass processing 
unit 16, the value of the direct current component (DC 

20 component) of the video data after the DCT processing, 

and the power value of the alternating current component 
(AC component) via a control signal C16 and calculates 
the difficulty of the pattern of the compressed video 
data based on these received values- Further, the host 

25 computer 20 allocates the target amount of data T. of the 




compressed video data generated by the encoder 18 via tlie 
control signal C18 for every picture based on the 
calculated difficulty, sets the same in the quantization 
circuit 166 (Fig- 3) of the encoder 18, and adaptively 
5 controls the compression rate of the encoder 18 in units 
of pictures . 

fi The encoder control unit 12 notifies the existence 

■CESS- 
,5X22. 

jf: of a picture of the noncompressed video data VIN to the 

Of host computer 20 and further performs the preprocessing 

fSl 10 for the compression and coding for every picture of the 

□ noncompressed video data VIN. That is, the encoder 

control unit 12 rearranges the input noncompressed video 
O data in the order of coding, performs picture field 

transformation, performs 3:2 pull down processing 
15 (processing for transforming the video data of 30 

frames/sec to video data of 24 frames/ sec ) , etc. when the 
noncompressed video data VIN is the video data of a movie 
and outputs the same as the video data S12 to the FIFO 
memory 160 and the encoder 162 of the simplified two pass 
20 processing unit 16. 

The motion detector 14 performs the detection of the 
motion vector of the noncompressed video data and outputs 
the same to the encoder control unit 12 and the encoders 
162 and 18. 

25 In the simplified two pass processing unit 16, the 
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FIFO memory 160 delays the video data S12 Input from the 
encoder control unit 12 by exactly a time by which for 
example L (L is an integer) number of pictures of the 
noncompressed video data VIN is input and outputs the 
5 same as the delayed video data S16 to the encoder 18. 

Figure 2 is a view of the configuration of the 
encoder 162 of the simplified two pass processing unit 16 
shown in Fig, 1 - 

The encoder 162 is a video data use compression and 

10 coding unit constituted by, for example, as shown in Fig. 
2, an adder circuit 164, a DCT circuit 166, a 
quantization circuit (Q) 168, a variable length coding 
circuit (VLC) 170, an inverse quantization circuit (IQ) 
172, an inverse DCT (IDCT) circuit 174, an adder circuit 

15 176, and a motion compensation circuit 178, compresses 

and codes the input video data S12 by the MPEG system or 
the like, and outputs the amount of data etc. for every 
picture of the compressed video data to the host computer 
20. 

20 The adder circuit 164 subtracts the output data of 

the adder circuit 176 from the video data SI 2 and outputs 
the result to the DCT circuit 166. 

The DCT circuit 166 performs discrete cosine 
transformation (DCT) processing on the video data input 

25 from the adder circuit 164 in units of DCT blocks 
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consisting of for example 8 pixels x 8 pixels, transforms 
the same from data of the time domain to data of the 
frequency domain, and outputs the result to the 
quantization circuit 168. Further, the DCT circuit 166 
outputs the value of the DC component of the video data 
after DCT and the power value of the AC component to the 
host computer 20 . 

The quantization circuit 168 quantizes the data of 
the frequency domain input from the DCT circuit 166 by 
the fixed quantization value Q and outputs the result as 
the quantized data to the variable length coding circuit 
170 and the inverse quantization circuit 172. 

The variable length coding circuit 170 performs 
variable length coding on the quantized data input from 
the quantization circuit 168 and outputs the amount of data 
of the compressed video data obtained as a result of the 
variable length coding via the control signal C16 to the 
host computer 20. 

The inverse quantization circuit 172 performs 
inverse quantization on the quantized data input from the 
variable length coding circuit 168 and outputs the same 
as the inverse quantized data to the inverse DCT circuit 
174. 

The inverse DCT circuit 174 performs inverse DCT 
processing on the inverse quantized data input from the 
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inverse quantization circuit 172 and outputs the result 
to the adder circuit 176. 

The adder circuit 176 adds the output data of the 
motion compensation circuit 178 and the output data of 
the inverse DCT circuit 174 and outputs the result to the 
adder circuit 164 and the motion compensation circuit 178. 

The motion compensation circuit 178 performs the 
motion compensation processing based on the motion vector 
input from the motion detector 14 on the output data of 
the adder circuit 176 and outputs the result to the adder 
circuit 176. 

Figure 3 is a view of the configuration of the 
encoder 18 shown in Fig. 1. 

As shown in Fig. 3, the encoder 18 is configured as 
the encoder 162 shown in Fig. 2 plus the quantization 
control circuit 180. The encoder 18 applies the motion 
compensation processing, DCT processing, quantization 
processing, and variable length coding processing with 
respect to the delayed video data S16 delayed by the 
amount of L pictures by the FIFO memory 160 by these 
constituent parts based on the target amount of data T^ 
set from the host computer 20 to generate the compressed 
video data VOUT of the MPEG system or the like and 
outputs the result to an external apparatus (not 
illustrated) . 




In the encoder 18, the quantization control circuit 
180 successively monitors the amount of data of the 
compressed video data VOUT output by the variable length 
quantization circuit 170 and successively adjusts the 
5 quantization value to be set in the quantization 

circuit 168 so that the amount of data of the compressed 
video data finally generated from the j-th picture of the 
delayed video data S16 approaches the target amount of 
data T^ set from the host computer 20* 

10 Further, the variable length quantization circuit 

170 outputs the compressed video data VOUT to the outside 
and in addition outputs the actual amount of data of 
the compressed video data VOUT obtained by compressing 
and coding the delayed video data S16 via the control 

15 signal C18 to the host computer 20* 

Below, an explanation will be made of the simplified 
two pass encoding operation of the video data compression 
apparatus 1 in the first embodiment. 

Figures 4A to 4C are views of the operation of the 

20 simplified two pass encoding of the video data 

compression apparatus 1 in the first embodiment* 

The encoder control unit 12 performs the 
preprocessing of rearrangement of pictures in the order 
of coding etc. with respect to the noncompressed video data 

25 VIN input to the video data compression apparatus 1 by 
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the encoder control unit 12 and outputs the same as the 
video data S12 to the FIFO memory 160 and the encoder 162 
as shown in Fig. 4A. 

Note that, due to the rearrangement of the order of 
5 pictures by the encoder control unit 12, the order of 
coding of the pictures shown in Figs* 4A to 4C etc. and 
p the order indicated after expansion and decoding are 

£ different. 

fjfl The FIFO memory 160 delays the pictures of the input 

ffi 10 video data S12 by exactly the amount of L pictures and 

Q outputs the same to the encoder 18. 

M The encoder 162 preliminarily successively 

compresses and codes the pictures of the input video data 
S12 and outputs the amount of data of the compressed and 
15 coded data obtained by compressing and coding the j (j is 
an integer) -th picture, the value of the DC component of 
the video data after the DCT processing, and the power 
value of the AC component to the host computer 20. 

For example, since the delayed video data S16 input 
20 to the encoder 18 is delayed by exactly the amount of L 
pictures by the FIFO memory 160, as shown in Fig. 4B, 
when the encoder 18 compresses and codes the j (J is an 
integer) -th picture (picture a of Fig. 4B) of the delayed 
video data SI 6, the encoder 162 compresses and codes the 
25 (3+L)-th picture (picture b of Fig. 4B) advanced from the 
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j-th picture of the video data S12 by the amount of L 
pictures- Accordingly, when the ^ncodBJo 18 starts the 
compression and coding of the j-th picture of the delayed 
video data S16, the encoder 162 has completed the 
5 compression and coding of the j-th to (j+L-l)-th pictures 
(range c of Fig. 4B) of the video data S12, and the real 
difficulty data , D J+1 „ D J+2 , , after compression 

and coding of these pictures have been already calculated 
by the host computer 20. 

10 The host computer 20 calculates the target amount of 

data T J to be allocated to the compressed video data 
obtained by compressing and coding the j-th picture of 
the delayed video data S16 by the encoder 18 by the 
following Equation 1 and sets the calculated target 

15 amount of data Tj in the quantization control circuit 180. 



Note, in Equation 1, is the real difficulty data 
of the j-th picture of the video data S12, R f ^ is an 
average of the target amount of data which can be allocated 
to the j-th to the (j+L-l)-th pictures of the video data 
20 S12 and S16 # and an initial value (R'J of R'^ is the 

target amount of data which can be allocated to pictures 
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of the compressed video data on the average, represented 
by following Equation 2, and updated as shown in Equation 
3 whenever the encoder 18 generates one picture's worth 
of the compressed video data. 
5 R\ = (Bit_rate x L) /Picture_rate (2) 

Note that, the numerical value bit rate (Bit rate) 
in Equation 3 indicates the amount of data (amount of 
bits) per second determined based on the transmission 

10 capacity of the communication line and the recording 
capacity of the recording medium, "Picture rate" 
indicates the number of pictures per second contained in 
the video data (30 pictures/sec (NTSC), 25 pictures/sec 
(PAL)), and the numerical value F j+L indicates the average 

15 amount of data per picture determined in accordance with 
the picture type . 

The DCT circuit 166 of the encoder 18 performs DCT 
processing on the j-th picture of the input delayed video 
data S16 and outputs the result to the quantization 

20 circuit 168. 

The quantization circuit 168 quantizes the data of 
the frequency domain of the j-th picture input from the 
DCT circuit 166 by the quantization value Q j adjusted 
based on the target amount of data T d by the quantization 

25 control circuit 180 and outputs the result as the 
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quantized data to the variable length coding circuit 170. 

The variable length coding circuit 170 performs 
variable length coding on the quantized data of the j-th 
picture input from the quantization circuit 168 so as to 
5 generate the compressed video data VOUT of an amount of 
data near the target amount of data and outputs the 
same. 

Similarly, as shown in Fig. 4B„ when the encoder 18 
compresses and codes the (j+l)-th picture (picture a' of 

10 Fig. 4C) of the delayed video data S16, the encoder 162 
completes the compression and coding of the (j+l)-th to 
(□+L)-th pictures (range c' of Fig. 4C) of the video data 
S12. The real difficulty data D j+1# D^ 2 , D d+3 , D 3+L _ L of 

these pictures have been already calculated by the host 

15 computer 20. 

The host computer 20 calculates the target amount of 
data T j+1 to be allocated to the compressed video data 
obtained by compressing and coding the (j+l)-th picture 
of the delayed video data S16 by the encoder 18 by 

20 Equation 1 and sets the same in the quantization control 
circuit 180 of the encoder 18. 

The encoder 18 compresses and codes the (j+l)-th 
picture based on the target amount of data T^ set in the 
quantization control circuit 180 from the host computer 

25 20 so as to generate the compressed video data VOUT of an 
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amount of data near the target amount of data T j+1 and 
outputs the same. 

Further, below, similarly, the video data 
compression apparatus 1 successively compresses and codes 
the k-th picture of the delayed video data S16 by 
changing the quantization value Q k (k « j+2, j+3, for 
every picture and outputs the result as the compressed 
video data VOUT. 

As explained above, according to the video data 
compression apparatus 1 shown in the first embodiment, 
the difficulty of the pattern of the noncompressed video 
data VIN can be calculated in a short time and the 
noncompressed video data VIN can be adaptively compressed 
and coded with a compression rate in accordance with the 
calculated difficulty. That is, according to the video 
data compression apparatus 1 shown in the first 
embodiment, the noncompressed video data VIN can be 
adaptively compressed and coded in almost real time based 
on the difficulty of the pattern of the noncompressed 
video data VIN unlike the two pass encoding system, and 
thus the apparatus can be applied to applications such as 
live broadcasts in which a real time property is required. 

Note that, in addition to that shown in the first 
embodiment, the data video data compression apparatus 1 
according to the present invention can adopt various other 
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configurations, for example, the amount of data of the 
compressed video data compressed and coded by the encoder 
162 may be used as it is as the difficulty data so as to 
simplify the processing of the host computer 20. 
5 Second Embodiment 

Below, a second embodiment of the present invention 

0 will be explained. 

O 

The simplified two pass encoding system shown in the 

U i 

II] first embodiment is an excellent system capable of 

01 10 performing compression and coding by giving only a delay of 
Q about one GOP (for example 0.5 second) to the input 

noncompressed video data and generating compressed video 
3 data of suitable amounts of data. 

In this system, however, two encoders are required. 

15 In general, an encoder for compressing and coding video 
data requires large hardware and is very expensive even 
if formed as an integrated circuit. In addition, it is 
large in size. Accordingly, the necessity of two encoders 
in this system prevents a reduction of the cost of the 

20 apparatus, a reduction of size, and saving of electric 
power. Further, it is desirable that the time delay 
required for the compression and coding be as short as 
possible, but the processing for calculation of the real 
difficulty data and the predictive difficulty data D., ' 

25 and the preliminary compression and coding processing per 




se require a few pictures' worth of processing time, 
therefore these processings per se become a factor 
preventing the shortening of the time delay* 

The second, embodiment was made so as to solve such a 
5 disadvantage and provides a video data compression method 
capable of generating compressed video data of suitable 
amounts of data equivalent to the simplified two pass 
encoding system by only using one encoder and in addition 
requiring a shorter time delay for the processing, 
.,: 10 Figure 5 is a schematic view of the configuration of 

L, a video data compression apparatus 2 according to a 

second embodiment of the present invention • 
11 Figure 6 is a detailed view of the configuration of 

a compression and coding unit 24 of the video data 
15 compression apparatus 2 shown in Fig- 5. 

Note that, in Fig. 5 and Fig- 6, among the 
constituent parts of the video data compression apparatus 
2, the parts the same as the constituent parts of the 
video data compression apparatus 1 explained in the first 
20 embodiment (Fig- 1 and Fig- 2) are indicated by same 
symbols or numerals. 

As shown in Fig. 5, the video data compression 
apparatus 2 adopts a configuration in which the 
compression and coding unit 10 of the video data 
25 compression apparatus 1 (Fig. 1 and Fig- 2) is replaced 
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by a compression and coding unit 24 obtained by excluding 
the encoder 162 from the compression and coding unit 10 , 
replacing the encoder control unit 12 by the encoder 
control unit 22, and adding a buffer memory (buffer) 182. 

As shown in Fig. 6, the compression and coding unit 
24 is constituted by an image rearranging circuit 220 , a 
scanning transforming and macroblock forming circuit 222, 
and a statistical amount (parameter) calculation circuit 
224. The rest of the constituent parts of the compression 
and coding unit 24 adopt the same configurations as those 
of the compression and coding unit 10. 

The encoder control unit 22 notifies the existence 
of a picture of the noncompressed video data VIN to the 
host computer 20 similar to the encoder control unit 12 
and further performs the preprocessing for compression 
and coding for every picture of the noncompressed video 
data VIN. 

In the encoder control unit 22, the image 
rearranging circuit 220 rearranges the input 
noncompressed video data in the order of coding. 

The scanning transforming and macroblock forming 
circuit 222 performs the picture field transformation and 
performs the 3:2 pull down processing etc. where the 
source of the noncompressed video data VIN is the video 
data of a movie. 
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The statistical amount calculation circuit 224 
calculates a statistical amount such as the flatness and 
intra AC from the pictures processed by the image 
rearranging circuit 220 and the scanning transforming and 
5 raacroblock forming circuit 222 and compressed and coded 
to I pictures . 

q The video data compression apparatus 2 uses a 

£ statistical amount (flatness, intra AC) of the 

yn; noncompressed video data and the predictive error amount 

on 10 (ME residual) of motion prediction in place of the 

□ difficulty of the pattern of the noncompressed video data 

VIN by these constituent parts, adaptively calculates the 
target amount of data T^ similar to the video data 
compression apparatus 1 (Fig, 1 and Fig. 2), and performs 
15 high precision feed forward control to compress and code 
the noncompressed video data VIN to compressed video data 
of suitable amounts of data. 

Note that, in the video data compression apparatus 
2, the target amount of data T^ is determined by the 
20 motion detector 14 and the statistical amount calculation 
circuit 224 of the encoder control unit 22 based on the 
indicator data (parameter) detected in a&vance. 
Therefore, below, the compression and coding system in the 
video data compression apparatus 2 will be referred to as 
25 a feed forward rate control (FFRC) system. 
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Note that, the ME residual is defined as the sum of 
the absolute values or the sum of the square values of 
the difference of the video data between pictures to be 
compressed and a reference picture, is calculated from 
5 the pictures which become the P pictures and B pictures 
after compression by the motion detector 14, represents 
the speed of the motion of the image and the complexity 
of the pattern, and has a correlation with the difficulty 
and the amount of data after compression similar to the 

10 flatness. 

The I picture is compressed and coded without 
reference to other pictures, so the ME residual cannot be 
found. Therefore, the flatness and intra AC are used as the 
parameters in place of the ME residual. 

15 Further, flatness is a parameter newly defined as an 

indicator representing the spatial flatness of the image 
so as to realize the video data compression apparatus 2, 
indicates the complexity of the video, and has a 
correlation with the difficulty (degree of difficulty) of 

20 the pattern of the image and the amount of data after the 
compression . 

Further, the intra AC is a parameter newly defined 
as the sum of the dispersion values of the video data for 
every DCT block of the DCT processing unit in the MPEG 
25 system so as to realize the video data compression 




apparatus 2, indicates the complexity of the image, and 
has a correlation with the difficulty of the pattern of 
the image and the amount of data after the compression 
similar to the flatness, 
5 Below, an explanation will be made of the ME 

residual, flatness, and intra AC. 

In the simplified two pass encoding system explained 
in the first embodiment, the real difficulty data 
indicated the difficulty of the pattern of the image and 

10 the target amount of data was calculated based on the 
real difficulty data D 3 . 

In order to make the amount of data of the 
compressed video data generated by the encoder 18 
approach the value indicated by the target amount of data 

15 T jr the quantization value is controlled in the 
quantization circuit 168 (Fig. 2 and Fig. 6). 
Accordingly, if a parameter obtained without compressing 
and coding the video data and adequately indicating the 
complexity (difficulty) of the video data similar to the 

20 real difficulty data D J can be obtained before the 

quantization processing in the quantization circuit 168 
of the encoder 18, it would be possible to achieve the 
objectives of eliminating the encoder 162 (Fig. 1) and 
shortening the processing delay time. The ME residual, 

25 the flatness, and the intra AC have a strong correlation 
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with the real difficulty data and therefore axe 
adequate for achieving these objectives. 

Relationshi p Between ME Residual and Real Difficulty 
Data D j 

When performing compression and coding processing by 
referring to other pictures to generate the P pictures 
and B pictures, the motion detector 14 searches for a 
raacroblock giving the smallest sum of the absolute values 
or sum of the square values of the difference between a 
particular macroblock of the picture (input picture) to 
be compressed and the picture referred to (reference 
picture) so as to find the motion vector • The ME residual 
is defined as the value obtained by adding up the sums of 
absolute values or sums of the squares of the differences 
of each of maoroblocks giving the smallest sums for all 
of the pictures when finding the motion vector in this way 

Figure 7 is a view of the correlation between the ME 
residual and the real difficulty data D^ when generating 
a P picture by the video data compression apparatuses 1 
and 2. 

Figure 8 is a view of the correlation between the ME 
residual and the real difficulty data D^ when generating 
a B picture by the video data compression apparatuses 1 
and 2. 

Note that, in Fig. 7 and Fig. 8, as the real 
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difficulty data D J# use is made of the amount of data of 
the compressed video data obtained by compression and 
coding by using a fixed quantization value by the encoder 
18 (below, the same in Fig, 10 and Fig* 11), and Fig, 7 
5 and Fig, 8 are graphs of the relationship between the ME 
residual and the real difficulty data obtained where 
standard images ["cheer" (cheer leaders), "mobile" 
(mobile and calendar), "tennis" (table tennis), and "diva" 
(diva with noise)] standardized by the CCIR and other 

10 images ("resort") are actually compressed and coded by 

the MPEG system. In Fig. 7 and Fig. 8, an ordinate of the 
graph (difficulty) indicates the real difficulty data D^, 
and abscissa (me resid) indicates the ME residual. 

As understood when referring to Fig. 7 and Fig, 8, 

15 the ME residual has a strong correlation with the real 
difficulty data D^. Accordingly, in place of the real 
difficulty data of a picture which becomes a P picture 
or a B picture after compression, the ME residual can be 
used for the generation of the target amount of data T d . 

20 Relationship between Flatness and Real Difficulty 

data D j 

Figure 9 is a view of the method of calculation of 
the flatness. 

The flatness is calculated by dividing each of the 
25 DCT blocks which serve as the units of DCT processing in 
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tlie MPEG system into small blocks each, consisting of 2 
pixels x 2 pixels as shown in Fig, 9, calculating the 
difference of the data of the diagonal pixels (pixel 
values) in these small blocks, comparing the difference 
5 with a predetermined threshold value, and further finding 
the sum of small blocks in which the difference becomes 
smaller than the threshold value for every picture. 

Note that the value of the flatness becomes smaller 
as the pattejrn of the video becomes spatially more 
10 complex and becomes larger when it is flatter. 

Figure 10 is a view of the correlation between the 
flatness and the real difficulty data Dj when generating 
an I picture by the video data compression apparatuses 1 
and 2. 

15 Note that. Fig. 10 is a graph of the relationship 

between the flatness and the real difficulty data 
obtained when the standard images standardized by the CCIR 
and other images are actually compressed and coded by the 
MPEG system. In Fig. 10, the ordinate of the graph 

20 (difficulty) indicates the real difficulty data D^, and the 
abscissa (flatness) indicates the flatness. 

As shown in Fig. 10, there is a strong negative 
correlation between the flatness and the real difficulty 
data Dj, and it is seen that the real difficulty data 

25 can be approximated by the method of substituting the 
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flatness into the primary function etc. 

Relationship Between Intra AC and Real Difficulty 

Data 

The intra AC is calculated as the sum of the 
absolute values of the difference between the pixel value 
of each pixel in a DCT block (8x8 pixels) and the average 
value (E) of the pixel values (Pk) in the DCT block for 
every DCT block • That is, the intra AC can be found by 
the following equation 4 . 

64 

lnlra-AC r £ £ \P r P{ 

picture *=l 

Figure 11 is a view of the correlation between the 
intra AC and real difficulty data D^ when generating an I 
picture by the video data compression apparatuses 1 and 
2. 

Note that. Fig. 11 is a graph showing the 
relationship between the intra AC and the real difficulty 
data Dj obtained where the standard images standardized 
by the CCIR and other images are actually compressed and 
coded by the MPEG system- In Fig. 11, the ordinate of the 
graph (difficulty) indicates the real difficulty data D^, 
and the abscissa (intra AC) indicates the intra AC. 

As explained heretofore, it is seen that the real 
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difficulty data can be approximated by the primary 
function etc* by each indicator data (statistical amount). 
Accordingly, it is possible to calculate the real 
difficulty data D 3 of each picture type as follows. 

The real difficulty data is approximated by the 
ME residual (ME d ) by the following Equation 5 for a P 
picture and by the following Equation 6 for a B picture. 
Further, the real difficulty data is approximated by 
the flatness (FR^ ) and intra AC (Intra AC^) or either of 
them by approximation equations (Equations 7 and 8) similar 
to Equations 5 and 6 for an I picture- 



D J - 


a p x ME^ + b p 




(5) 


D l - 


a B x MEj + b B 




(6) 


D ; = 


a x x FR.j + b x 




(7) 


D ; - 


a* j x (intra AC^) + 




(8) 


Here , a. p , 


a B , a I# and sl z % are 


proportional 





coefficients . 

Further, in the simplified two pass encoding system 
shown in the first embodiment, the target amount of data 

was calculated by substituting the real difficulty 
data D d obtained by these approximations into Equation 1 . 

Below, the operation of the video data compression 
apparatus 2 will be explained by taking as an example a 
case where the real difficulty data is approximated by 
the ME residual, the flatness, and the intra AC and the 
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noncompressed video data is compressed and coded by the 
simplified two pass encoding system* 

In the encoder control unit 22, the image 
rearranging circuit 220 rearranges the noncompressed 
5 video data VIN in pictures in the order of coding. The 
scanning transf ormation and macroblock forming circuit 
222 performs the picture field transformation etc. The 
I- statistical amount calculation circuit 224 performs the 

ffj computation processing shown in Fig. 9 aad Equation 4 

S 10 with respect to a picture to be compressed and coded to 

□ an I picture and calculates a statistical amount such as 

the flatness and intra AC. 

□ The motion detector 14 generates motion vectors for 

the pictures to be compressed and coded to a P picture 

15 and B picture and further calculates the ME residual. 

The FIFO memory 160 delays the input video data by 
exactly an amount of L pictures. 

The host computer 20 performs the computation 
processing shown in Equation 7 and Equation 8 with respect 

20 to the ME residual generated by the motion detector 14 to 
approximate the real difficulty data and performs the 
computation processing similar to Equation 7 and Equation 
8 to approximate the real difficulty data D j by the 
flatness and the intra AC. 

25 Further, the host computer 20 substitutes the 
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approximated real difficulty data D j into Equation 1 to 
calculate the target amount of data T^ and sets the 
calculated target amount of data T_j in the quantization 
control circuit 180 of the encoder 18- 
5 The DCT circuit 166 of the encoder 18 performs DCT 

processing on the j-th picture of the delayed video data, 
p The quantization circuit 168 quantizes the data of 

JJ the frequency domain of the j-th picture input from the 

ifl DCT circuit 166 by the quantization value adjusted by 

pi 10 the quantization control circuit 180 based on the target 

O amount of data T^ . 

M The variable length coding circuit 170 performs 

= Hi 

O variable length coding on the quantized data of the j-th 

picture input from the quantization circuit 168, generates 

15 the compressed video data VOUT of an amount of data near 
the target amount of data T j# and outputs the same via 
the buffer memory 182 to an external unit. 

Note that, in the TM5 system or the like known as 
the compression algorithm of the MPEG, to calculate the 

20 quantization value of the macroblock (MQUANT), a 

statistical amount such as the "activity" shown in the 
following Equation 9 is used. The activity has a strong 
correlation with the real difficulty data similar to the 
flatness and the intra AC, therefore it is also possible 

25 to constitute the video data compression apparatus 2 so 
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as to perform the compression and coding by approximating 
the real difficulty data D d using the activity in place 
of these parameters. 

act = I * m'm^-ljjtvar sblk) 

Where, act^: j-th activity 
sblk: sub block 
var: dispersion. 

Further, while the operation of the video data 
compression apparatus 2 was explained by* taking as an 
example a case where the simplified two pass encoding 
shown in the first embodiment was carried out, needless 
to say the video data compression apparatus 2 can perform 
the predictive simplified two pass encoding as well. 

Further, modifications similar to those with respect 
to the video data compression apparatus 1 shown in the 
first embodiment are also possible with respect to the 
video data compression apparatus 2 shown in the second 
embodiment . 

Third Embodiment 

Below, a third embodiment of the present invention 
will be explained. 

In the FFRC system shown in the second embodiment. 
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the real difficulty data was approximated by 
substituting the statistically found indicator data 
(statistical amount), that is, the ME residual, flatness, 
intra AC, and activity into the primary function such as 
5 Equation 5 and Equation 6* 

These indicator data and the real difficulty data 
JIT have a strong correlation as shown in Fig. 7, Fig. 8, 

Fig. 10, and Fig. 11, but a slight difference occurs from 
f^l the above primary function according to the pattern of 

fl 2 10 the video data. 

The processing of the video data compression 
y apparatus 2 in the third embodiment is designed so as to 

*4 solve such a disadvantage and is enhanced so that the 

proportional coefficients ap, a^, a I# a T f , etc. shown in 
15 Equation 5 and Equation 6 etc. are adaptively adjusted 
every moment in accordance with the pattern etc. of the 
video data so that the real difficulty data can be 
approximated by the indicator data with a higher 
precision than that in the second embodiment and 
20 compressed video data having a higher quality can be 
produced - 

Below, the processing of the video data compression 
apparatus 2 in the third embodiment will be briefly 
explained. 

25 Whenever the encoder 18 of the video data 
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compression apparatus 2 (Fig. 6) ends one picture's worth 
of compression and coding, the host computer 20 learns one 
picture's worth of the amount of data of the generated 
compressed video data and the average value of the 
5 quantization values Q d at the time of the compression and 
coding and the global complexity explained below can be 
calculated. 

The global complexity is defined as shown in the 
following Equation 10-1 to Equation 10-3 as a value 
10 obtained by multiplying the amount of data of the 

compressed video data and the quantization value Q d in 
TM5 of the MPEG and indicates the complexity of the 
pattern of the image, 

X x = S, x Qj {10-1) 
15 X p = S p x Q p . (10-2) 

X. = S B x Q B (10-3) 
Note that, in Equation 10-1 to Equation 10-3, S I# S B , 
and S p indicate the generated amounts of data of the 
coded 1 picture, B picture, and P picture, Q x . Q B , and Q p 
20 indicate the average values of the quantization values 
when producing the I picture, B picture, and P picture, 
and X I# X b , and Xp indicate the global complexity of the I 
picture, B picture, and P picture, respectively. 

The global complexities shown in Equation 10-1 to 
25 Equation 10-3 do not always coincide with the real 
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difficulty data D j# but substantially coincide with, the 

real difficulty data D j so far as the average value of 

the quantization values is not extremely large or small. 

Here, when the indicator data of the I picture, P 
5 picture, and the B picture, for example the intra AC 

(other parameter is also possible) and ME residual, and the 
global complexity have a proportional relationship, the 
proportional coefficients £*, £ P , and £ B between these 
indicator data and global complexity can be calculated by 
10 following Equation 11-1 to Equation 11-3. 

Case of I picture (11-1) 

J Infra^iC 



Case of P picture (11-2) 



J ME_resid 



Case of B picture (11-3) 



B X B 

e. =- 



y ME_resid 



15 



The real difficulty data of each picture type is 
calculated as shown in following Equation 12-1 to 



a 
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Equation 12-3 by using the proportional coefficients e 1 , 

£ p , and £ B calculated by Equation 11-1 to Equation 11-3 • 

Case of I picture (12-1) 

D/^e'jK IntraJLC } 

Case of P picture (12-2) 

Dj P =ZjX ME_resid } 



Case of B picture ( 12 3 ) 



The host computer 20 calculates and optimizes the 
proportional coefficients £ z , £ P , and € B whenever the 
encoder 18 compresses and codes one picture as shown in 

10 Equation 11-1 to Equation 11-3 and finds the values of 

the real difficulty data of picture types by Equation 
12-1 to Equation 12-3 , whereby the real difficulty data 
can be always optimally approximated by the indicator 
data irrespective of the pattern of the video data. 

15 The host computer 20 performs the computation 

processing shown in Equation 1 with respect to the real 
difficulty data approximated as shown in Equation 11 
and Equation 12 to calculate the target amount of data T^. 
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Note that, as In TM5 of the MPEG, when the value of 
the target amount of data to he actually calculated Is 
intentionally changed hy a constant ratio with respect to 
the value determined based on the real difficulty data 
5 D jr the target amount of data can be calculated by the 
following Equation 13-1 to Equation 13-3: 

Case of I picture (13-1) 

1 Tm xR 



P _pictnrt Bj>ictnrc 



10 



Case of P picture (13-2) 



w xRj 

r jtlcnre Bjianre 



Case of B picture ( 13 3 ) 



T = ^ 

P _picture B_pictrire 



Note that, in all denominators of Equation 13-1 to 
15 Equation 13-3, D 1VB indicates the real difficulty data 
approximated by the indicator data produced from L 
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pictures ' worth, of the noncompressed video data buffered 
in the FIFO memory 160 before input to the encoder 18 , 
and indicates the average value of the amounts of data 
which can be assigned to L pictures after the j-th picture. 
5 Below, the operation of the video data compression 

apparatus 2 according to the third embodiment will be 
explained . 

Figure 12 is a view of the compression and coding 
operation of the video data compression apparatus 2 (Fig. 
10 6} in the third embodiment. 

The encoder control unit 22 rearranges the pictures 
of the noncompressed video data VIN in the order of 
coding similar* to the second embodiment, performs the 
picture field transformation, etc. and calculates the 
15 statistical amount such as flatness and intra AC from the 

3+N-th picture to be compressed and coded to the I picture. 

The motion detector 14 produces a motion vector for 
the j+N-th picture to be compressed and coded to the I 
picture or B picture similar to that in the first 
20 embodiment and second embodiment and further calculates 
the ME residual (Fig. 12A) . 

The FIFO memory 160 delays the input video data by 
exactly the amount of L pictures similar to the first 
embodiment and the second embodiment. 
25 The host computer 20 performs the computation 
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processing shown in Equation 12-1 and 12-2 with respect to 
the ME residual produced by the motion detector 14, 
approximates the real difficulty data D j# performs the 
computation processing shown in Equation 12-3, and 
approximates the real difficulty data by the intra AC, 
etc. (Fig. 12B). 

Further, the host computer 20 substitutes the 
approximated real difficulty data D d into Equation 1 or 
Equations 13-1 to 13-3 to calculate the target amount of 
data Tj and sets the same in the quantization control 
circuit 180 of the encoder 18 (Fig. 12C) . 

The DCT circuit 166 of the encoder 18 performs the 
DCT processing for the j-th picture of the delayed video 
data similar to that in the first embodiment and the 
second embodiment. 

The quantization circuit 168 quantizes the data of 
the frequency domain of the j-th picture input from the 
DCT circuit 166 by the quantization value Q J adjusted 
based on the target amount of data T 3 by the quantization 
control circuit 180 and, at the same time, calculates the 
average value of the quantization values used for the 
compression and coding of the j-th picture and outputs 
the same to the host computer 20. 

The variable length coding circuit 170 performs 
variable length coding on the quantized data of the j-th 
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picture Input from the quantization circuit 168 similar to 
that in the first embodiment and second embodiment to 
produce the compressed video data VOUT of an amount of 
data near the target amount of data and outputs the 
same via the buffer memory 182. 

When the encoder 18 terminates the compression and 
coding of the j-th picture, the host computer 20 
calculates the global complexity as shown in Equation 10- 
1 to Equation 10-3 based on the average value of the 
quantization values with respect to the j-th picture 
input from the quantization control circuit 180 and the 
amount of data of the compressed and coded J-th picture 
(Fig. 12D). 

Further, the host computer 20 updates the 
proportional coefficients C 1 , £ P , and £ B as shown in 
Equation 11-1 to Equation 11-3 by the calculated global 
complexity (Fig. 12E) . The updated proportional 
coefficients E x , £ P , and £ B are reflected in the 
transformation equations at the time of compression and 
coding of the next picture (Equation 12-1 to Equation 12- 
3). 

The content of the processing of the host computer 
20 in the third embodiment will be further explained by 
referring to Fig. 13. 

Figure 13 is a view of the content of the processing 




of the host computer 20 (Fig, 6) of the video data 
compression apparatus 2 in the third embodiment. 

As shown in Fig. 13, at step 300 (S300), the host 
computer 20 fetches the (j+M-th indicator data 
(statistical amount ) such as ME residual or intra AC from 
the encoder control unit 22 or the motion detector 14. 

At step 302 (S302), the host computer 20 decides to 
which picture type the (j+l)-th picture is to be 
compressed and coded. When the (j+l)-th picture is to be 
compressed and coded to an I picture, the operation 
routine proceeds to the processing of S304; when it is to 
be compressed and coded to a P picture, the operation 
routine proceeds to the processing of S306; and when it 
is to be compressed and coded to a B picture, the 
operation routine proceeds to the processing of S308. 

In each of step 304 (S304), step 306 (S306), and 
step 308 (S308), the host computer 20 approximates the 
real difficulty data by Equation 12-1 to Equation 12-3. 

At step 310 (S310), the host computer 20 calculates 
the target amount of data by Equation 1 or Equation 
13-1 to Equation 13-3 by using the approximated real 
difficulty data 

At step 312 (S312), the encoder 18 compresses and 
codes the j-th picture. 

At step 314 (S314), the host computer 20 calculates 
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the global complexities X ±r X^ and X p [X (I, B # P)] from 
the amount of data of the j-th picture compressed by the 
encoder 18 and the average value of the quantization 
values Q d set in the quantization circuit 168 by the 
quantization control circuit 180. 

At step 316 (S316), the host computer 20 decides to 
which picture type the (J+l)-th picture is to be 
compressed and coded. Where the (j+l)-th picture is to be 
compressed and coded to an I picture, the operation 
routine proceeds to the processing of S318; when it is to 
be compressed and coded to a P picture, the operation 
routine proceeds to the processing of S320; and when it 
is to be compressed and coded to a B picture, the 
operation routine proceeds to the processing of S322. 

In each of step 318 (S318), step 320 (S320)m and 
step 322 (S322), the host computer 20 updates the 
proportional coefficients e 1 , £ P , and £ B by Equation 11-1 to 
Equation 11-3. 

At step 324 (S324), the host computer 20 increments 
the numerical value j . 

Note that, as mentioned in the second embodiment, 
there is a case where an offset (5 P ) exists between the 
real difficulty data and the multiplied value of the 
proportional coefficients e 1 , £ P , and £ B -and the indicator 
data. In such a case, by dividing the values obtained by 




subtracting the offset values 5 1 , 5 P , and 5 B from tlie 
global complexities X t , X^, and X p by the indicator data, 
the proportional coefficients E 1 , £ P , and £ B can be 
calculated. 

5 Further, modifications similar to those shown in the 

second embodiment are also possible for the operation of 
the video data compression apparatus 2 shown in the third 
embodiment . 

As mentioned above, according to the operation of 
10 the video data compression apparatus 2 in the third 

embodiment, the same effect as that by the operation of 
the video data compression apparatus 2 shown in the second 
embodiment is obtained and, in addition, a further 
correct target amount of data than that in the second 
15 embodiment can be calculated, and as a result, the 

quality of the compressed video data can be improved. 
Fourth Embodiment 

Below, a fourth embodiment of the present invention 
will be explained. 

20 Up to here, the explanation was made of the feed 

forward rate control ( FFRC } system for achieving both an 
improvement of quality of the compressed video data and a 
real time property of the compression and coding 
processing by using the indicator data (statistical 

25 amount), that is, flatness, intra AC, activity, and ME 
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residual, as the second embodiment and the third 
embodiment . 

In the fourth embodiment , an explanation will be 
made of a video data compression method (improved FFRC 
method) using a video data compression apparatus 2 (Fig. 
16 and Fig. 17) with which the encoder for obtaining the 
real difficulty data is unnecessary and in addition 
the quality of the compressed video data of the border 
(scene change) part of the video data (scene) contained 
in the edited video data is not lowered. 

It can be considered to perform the compression and 
coding by detecting a part where the real difficulty data 
largely changes along with time as a scene change part 
and changing the picture type sequence. The detection of 
such a scene change is possible also in the FFRC method 
by monitoring the change of the real difficulty data 
approximated by the indicator data in place of the real 
difficulty data along with the elapse of time. 

However, to determine the existence of a scene 
change, it is necessary to monitor the change of the 
indicator data within a range of about 1 "GOP in time before 
and after the scene change part. In the video data 
compression apparatus 2, the detection of the scene 
change part becomes possible only after a considerable 
time has elapsed after the motion detector 14 calculates 
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the Indicator data. In actuality, there also exists a 
possibility that the detection of the scene change part 
will become possible only immediately before the 
compression and coding processing in the encoder 18. 

Accordingly, the host computer 20 must substantially 
terminate the processing for approximating the real 
difficulty data by the indicator data (Equations 5 to 8 
shown in the second embodiment and Equation 12-1 to 
Equation 12-3 shown in the third embodiment) before the 
detection of the scene change in order to ensure 
sufficient processing time. 

The video data compression apparatus 2 in the fourth 
embodiment provisionally performs the processing for 
approximation of the real difficulty data by the 
indicator data or the global complexity in a state where 
the result of detection of the scene change is not 
determined, corrects only the part requiring a change along 
with a scene change in the provisionally calculated real 
difficulty data after the existence of a scene change 
and the existence of a change of the picture type sequence 
are determined, and thereby performs the processing for 
calculating the target amount of data T^. 

Below, the content of the compression and coding 
processing of the video data compression apparatus 2 in 
the fourth embodiment will be explained by taking as an 
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example a case where the picture type sequence with 
respect to N number of pictures is finally determined 
whenever the calculation of the ME residuals of N [for 
simplification of the explanation, below, for example it 
is assumed that N « L (L is the number of pictures 
corresponding to the delay time of the FIFO memory 160)] 
number of pictures is carried out. Note that, the N 
number of pictures used for the determination of the 
picture type sequence are the processing units of the 
processing for determining the picture type sequence and 
do not always have to coincide with the picture type 
sequence in the encoder 18- Further, the header does not 
have to be an I picture unlike the usual GOP. Further, 
below, one set of such N number of pictures will be also 
referred to as a rate control GOP (RCGOP). 

Figure 14 is a view of the compression and coding 
operation of the video data compression apparatus 2 in 
the fourth embodiment (Fig. 5) in the order of coding. 

The motion detector 14 generates a motion vector for 
the (j+N)-th picture to be compressed and coded to a P 
picture and a B picture similar to that in the first 
embodiment to the third embodiment and further calculates 
the ME residual. 

The encoder control unit 22 performs the 
preprocessing of the rearrangement etc. of pictures 
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similar to that In the second embodiment and the third 
embodiment and further calculates the indicator data such 
as the flatness, intra AC, and activity. 

The FIFO memory 160 delays the input video data by 
exactly an amount of L pictures similar to that in the 
first embodiment to the third embodiment. 

Whenever the video data compression apparatus 2 
(Fig. 5) ends one picture's worth of compression and 
coding, the host computer 20, similar to that in the second 
embodiment and the third embodiment, receives as input 
the flatness, intra AC, and activity calculated by the 
encoder control unit 22 and the MR residual (statistical 
amount) calculated by the motion detector 14. The host 
computer 20 stores these indicator data (Fig. 14A) . 
Further, assuming that no scene change occurs and no 
change will occur in the picture sequence, the host 
computer 20 approximates and predicts the real 
difficulty data in the case assuming that no scene 
change exists by Equation 12-1 to Equation 12-3 by using 
the optimized proportional coefficients fi 1 , £ P . and £ B 
(Equation 11-1 to Equation 11-3 shown in the third 
embodiment) similar to that in the third embodiment (Fig. 
14B) . 

Specifically, assuming that the N-th picture from 
the I picture of the first RCGOP is compressed and coded 
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to an I picture, the pictures of the order of whole 
multiples of M (n x M) are compressed and coded to P 
pictures, and the pictures other than them are compressed 
and coded to B pictures, the host computer 20 substitutes 
the indicator data produced by the pictures to be 
respectively compressed and coded to an I picture, P 
picture, and B picture and the proportional coefficients 
e 1 , € P , and £ B into Equation 12-1 to Equation 12-3 to 
approximate and calculate the real difficulty data D^. 
Note, M indicates the interval of the P pictures in the 
encoder 18 where there is no scene change. 

That is, the host computer 20 counts the number of 
pictures by using the I picture of the previous RCGOP 
(first CGOP: RCGOP # l) as a reference, assumes which 
picture type the encoder 18 compresses and codes each 
picture of the second RCGOP (RCGOP # 2) to, and 
approximates and predicts the value of the real difficulty 
data D d by the indicator data in accordance with the 
assumed picture type as shown in Equation 12-1 to 
Equation 12-3, 

Note that, the probability of existence of the scene 
change part in the RCGOP is considered to be relatively 
small, therefore the host computer 20 will calculate the 
target amount of data with respect to most RCGOP based 
on the predicted real difficulty data (Fig. 14F)- 
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Further , the real difficulty data is only used in 
the calculation of denominator of Equation 1 (first 
embodiment) or Equation 13-1 to Equation 13-3 (third 
embodiment)- As will be explained later, the host 
computer 20 performs correction at a stage where the 
existence of a change of the picture type sequence is 
determined, therefore the value of the target amount of 
data T d can be always correctly calculated. 

When the calculation of the real difficulty data 
of each picture of the second RCGOP (RCGOP # 2) is 
terminated, the host computer 20 can detect a scene 
change in the second RCGOP by using the calculated real 
difficulty data or indicator data. For example, the 
host computer 20 monitors the change of the value of the 
real difficulty data along with the elapse of time and 
determines that a scene change is generated in a P 
picture by the method of detecting the P picture in which 
the value of the real difficulty data indicates a 
value of for example 1.5 (practically, preferably a 
value within a range of from 1.4 to 1.8) times the real 
difficulty data D^ x of the P picture immediately before 
this, and so on. Where the scene change is detected, the 
host computer 20 further controls the encoder 18 and 
changes the picture type sequence when compressing and 
coding the parts before and after the scene change of the 




edited video data so as to change the f irst P picture of 
the back scene to an I picture not referring to the last 
picture of the previous scene and change the last I 
picture of the previous scene to the P picture as shown 
5 in Figs. 15B and 15C. 

By such processing of the host computer 20 , the 
existence of a change of the picture type sequence is 
seen, and the picture type to which each picture is to be 
compressed and coded is decided (Fig. 14C) . 
10 The host computer 20 corrects the value of the real 

difficulty data for the second RCGOP based on the 
stored indicator data and the picture type after the 
change where there is a change in the picture type 
sequence, calculates the correct real difficulty data D j 
15 (Fig. 14D), calculates the target amount of data T N+1 

(target bit) of the (N+l)-th picture in accordance with 
each picture type by using Equation 1 and Equation 13-1 
to Equation 13-3 (Fig. 14E), and sets the same in the 
quantization control circuit 180 of the encoder 18. 
20 Specifically, as shown in Fig. 15C, the host 

computer 20 substitutes the indicator data of the picture 
changed so as to become not the P picture, but the I 
picture after the compression into Equation 12-2 in place 
of Equation 12-1, and conversely, substitutes the 
25 indicator data of the picture changed so as to become not 



the I picture, but the P picture after the compression 
into Equation 12-1 in place of Equation 12-2, thereby to 
correct the value of the real difficulty data D^. 

The DCT circuit 166 of the encoder 18 performs DCT 
processing similar to that in the first embodiment to the 
third embodiment. 

The quantization circuit 168 quantizes the DCT- 
processed video data by the quantization value 
adjusted by the quantization control circuit 180 based on 
the target amount of data T d and calculates the average 
value of the quantization values Qj. 

The variable length coding circuit 170 performs 
variable length coding similar to that in the first 
embodiment to the third embodiment, produces the compressed 
video data VOUT of an amount of data near the target 
amount of data T J# and outputs the same via the buffer 
memory 182. 

When the encoder 18 terminates the compression and 
coding of the j-th picture, the host computer 20 
calculates the global complexity as shown in Equation 10- 
1 to Equation 10-3 based on the average value of the 
quantization values Q j# and the amount of data of the 
compressed and coded j-th picture. 

Further, the host computer 20 updates and optimizes 
the proportional coefficients e 1 , € P , and E B as shown in 
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Equation 11-1 to Equation 11-3 by the calculated global 
complexity. In the same way as in the third embodiment, the 
updated proportional coefficients C 1 , £ P , and E B are 
reflected in the transformation equations at the time of 
compression and coding of the next picture (Equation 12-1 
to Equation 12-3). 

The content of the processing of the host computer 
20 in the fourth embodiment will be further explained 
referring to Fig. 16. 

Figure 16 is a flowchart of the content of the 
processing of the host computer 20 in the video data 
compression apparatus 2 in the fourth embodiment. 

As shown in Fig. 16, the processing of the host 
computer 20 in the fourth embodiment is divided into a 
first stage (S400) and a second stage (S420). At the 
first stage, processing for predicting the real 
difficulty data D J is carried out by assuming that there 
is no scene change and there is no change in the picture 
type sequence, and at the second stage, processing for 
correcting the value of the real difficulty data is 
carried out where a scene change occurs and the picture 
type sequence is changed. 

The first stage (S400; S402 to S412) is the 
processing for predicting the real difficulty data 
where there is no scene change. At step 402 (S402) of the 
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first stage , the host computer 20 fetcb.es the indicator 
data (statistical amount) such as the ME residual or 
intra AC of the (j+L)-th picture from the? encoder control 
unit 22 or the motion detector 14 and stores the same. 

At step 404 (S404), the host computer 20 decides 
whether or not the U+L (j+N)]-th picture is to be 
compressed and coded to a B picture. Where the (j+L)-th 
picture is to be compressed and coded to a B picture, the 
operation routine proceeds to the processing of S406, 
while where it is not to be compressed and coded to a B 
picture, the operation routine proceeds to the processing 
of S408. 

At step 406 (S406), the host computer 20 predicts 
that the (j+L)-th picture is to be compressed and coded 
to a B picture and approximates and calculates the real 
difficulty data by Equation 12-3. 

At step 408 (S408), the host computer 20 decides 
whether or not the number (interval) from the picture to 
be compressed and coded to the I picture in the previous 
RCGOP to the (j+L)-th picture of the current RCGOP is N. 
Where the interval is N r the operation routine proceeds 
to the processing of S412, while where it is not N, the 
operation routine proceeds to the processing of S410. 

At step 410 (S410), the host computer 20 predicts 
that the (j+L)-th picture is to be compressed and coded 
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to a P picture and approximates and calculates the real 
difficulty data by Equation 12-2. 

At step 412 (S412), the host computer 20 predicts 
that the (j+L)-th picture is to be compressed and coded 
to an I picture and approximates and calculates the real 
difficulty data D j by Equation 12-1. 

The second stage (S420; S422 to S434) is the 
processing for correcting the real difficulty data D., 
predicted at the first stage. At step 422 (S422) of the 
second stage, the host computer 20 decides whether or not 
a new RCGOP starts. Where it does not start, the 
operation routine proceeds to the processing of S430, 
while where it starts, the operation routine proceed to 
the processing of S424. 

At step 424 (S424), the host computer 20 decides 
whether or not the picture type sequence is changed so 
that the position of the I picture changes. Where the 
picture type sequence is changed so that the position of 
the I picture changes, the operation routine proceeds to 
the processing of S426, while where it is not changed, 
the operation routine proceeds to the processing of S430. 

At step 426 (S426), the host computer 20 
approximates and calculates the real difficulty data D } 
by Equation 12-1 for the picture newly compressed and 
coded to the I picture. 
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At step 428 (S428), the liost computer 20 
approximates and calculates the real difficulty data 
by Equation 12-2 for the picture newly compressed and 
coded to the P picture. 
5 At step 430 (S430), the host computer 20 calculates 

the target amount of data with respect to the j-th 
p picture by Equation 1 or Equation 13-1 to Equation 13-3 

£ and sets the same in the quantization control circuit 180 

111 of the encoder 18 {Fig. 5 ) . 

fjl 10 At step 432 (S432), the encoder 18 compresses and 

O codes the j-th picture based on the target amount of data 

M T set in the quantization control circuit 180. 

O At step 434 (S434), the host computer 20 increments 

the numerical value j . 
15 Note that, in the fourth embodiment, the host 

computer 20 of the video data compression apparatus 2 
performs the processing for correcting only the real 
difficulty data of pictures changed after compression 
where there is a scene change, but when there is a room 
20 in the processing time, it can be modified so that the real 
difficulty data of all pictures are calculated after 
the picture type sequence is decided. 

Further, modifications similar to those in the first 
embodiment to the third embodiment are also possible for 
25 the operation of the video data compression apparatus 2 
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shown in the fourth embodiment. 

Further, the contents of the processing of the video 
data compression apparatuses 1 and 2 respectively 
explained in the first embodiment to the third embodiment 
5 can be combined so far as they do not contradict each 
other . 

As mentioned above, according to the operation of 
the video data compression apparatus 2 in the fourth 
embodiment, the same effect as that by the operation of 

10 the video data compression apparatus 2 shown in the second 
embodiment and the third embodiment is obtained, a 
further correct target amount of data than that in 
those embodiments can be calculated, and in addition the 
quality of the compressed video data of the scene change 

15 part is not lowered. 

As explained above, by the video data compression 
apparatus according to the present invention and the 
method of the same, audio and/or video data can be 
compressed and coded to a predetermined amount of data or 

20 less by a method other than the two pass encoding. 

Further, by the video data compression apparatus 
according to the present invention and the method of the 
same, the video data can be compressed and coded in 
almost real time and in addition a high quality image can 

25 be obtained after expansion and decoding. 
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Further, by the video data compression apparatus 
according to the present invention and the method of the 
same, the compression rate is adjusted by estimating the 
amount of data after the compression and coding and the 
compression and coding processing can be carried out by a 
method other than two pass encoding. 



